WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Intemational Bureau 




PCX 

INTERNATIONAL APPLICAtldNl>llBUSHED UNDM THE PATENT COOPERATION'TREATY (PCT)" 



(51) International Patent Classiflcatioii ^ : 
G06F3/06 



Al 



(11) International Publication Number: 
(43) Intemational Publication Date: 



WO 99^8067 

29 July 1999(29.07.99) 



(21) Intemational Apptication Number: PCT/U$99/01282 

(22) Intemational Filing Date: 20 January 1999 (20.01 .99) 



(30) Priority Data: 
09/009.945 



21 January 1998(21.01.98) 



US 



(71) Applicant: MYLEX CORPORATION [US/US]; 34551 Arden- 

wood Boulevaixl, Fremont. CA 94555 (US). 

(72) Inventors: SURUGGUCMI, Krishnakumar. Rao; 32875 Blue- 

bird Court, FrcmtMit, CA 94555 (US). KAILASH; 165 Park 
Ellen Drive, San Jose. CA 95136 (US). 

(74) Agents: ANANIAN. Michael. R. et al.; Flchr. Hohbach, 
Test. Albritton & Herbert LLP, Suite 3400, 4 Embarcadero 
Center. San Francisco, CA 941 1 1-4187 (US). 



(81) Deagnated States: AL. AM. AT, AU. AZ. BA. BB, BG, BR. 
BY. CA. CH. CN, CU. CZ. DE. DK. EE. ES. FI. GB, GD. 
GE. GH, GM, HR, HU. ID, IL. IN, IS, JP, KE. KG, KP, 
KR, KZ, LC. LK, Ut. LS, LT. LU. LV, MD, MG. MK. 
MN, MW, MX, NO, NZ, PL. PT, RO, RU, SD, SE, SG. 
SI, SK, SL. TJ. TM, TR. TT. UA. UG, UZ, VN, YU. ZW. 
ARIPO patent (GH. GM, KE, LS, MW, SD, SZ, UG. ZW), 
Eurasian patent (AM. AZ, BY, KG, KZ. MD. RU, TJ. TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR. 
GB, GR, IE, FT, LU, MC, NL. PT. SE), OAPI patent (BF, 
BJ. CF. CG, a. CM. GA, GN, GW, ML. MR, NE, SN. 
TD.TG). 



Published 

With intemational search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) TlUe: AN APPARATUS AND METHOD FOR AUTOMATIC CONHGURATION OF A RAID CONTROLLER 



rtOSI oysXBnl 

100 

1 tnn* 

nosi 

108 -X. PfiflDhefsi 


102-^ 

RMD 
Contpolter 


SCStCharmdo 108A 




rtOSl 
CPU/ 
Memofy 


Bis 104 


f=] ... mpfi 

110«^ HON-' 
. • • 

SCSIChanneVi 106N 

4 Z 






... ^ ... irUi 



(57) Abstract 

The present inventicm peitains to an ap^^ for automatically-configuring disk drivcs connected.to.a RAID controller . 

The automatic configuration mechanism is able to generate a fiill configuration of die disk drives connected to a RAID controller both at 
system initialization or bootap and at nmtimc. The mechanism uses a robust criteria to configure tfie disk drives which allows the drives to 
be configured in accordance with one or more RAID levels and which considers any existing configurations. The automatic configuration 
mechanism is advantageous since it eliminates user interaction, time, and knowledge often lequued to configure disk drives connected to a 
RAID controller. 
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AN APPARATUS AND METHOD FOR AUTOMATIC CONFIGURATION 

OF A RAID CONTROLLER 

Brief DescriptioTi of the Invention 

The present mvention relates generally to peripheral controllers. More 
particularly, the invention relates to the automatic configuration of Redundant Array 
of Independent Disks (RAID) controllers. 

5 

Background of the TnventioTi 

RAID is a technology used to improve the I/O performance and reliability of 
mass storage devices. Data is stored across multiple disks in order to provide 
immediate access to the data despite one or more disk failures. The RAID 
1 0 technology is typically associated with a taxomony of techniques, where each 
technique is referred to by a RAID level. There are six basic RAID levels, each 
having its own benefits and disadvantages. RAID level 2 uses non-standard disks 
and as such is not commercially feasible. 

15 RAID level 0 employs "striping" where the data is broken into a number of 

stripes which are stored across the disks in the array. This technique provides higher 
performance in accessing the data but provides no redundancy which is needed for 
disk failures. 

20 RAID level 1 employs ^'mirroring" where each unit of data is diq)licated or 

'*mim)red" onto another disk drive. Mirroring requires two or more disk drives. For 

read operationsTthis teclmique is adva^ 

performed in parallel. A drawback with mirroring is that it achieves a storage 
efiBciency of only 50%. 

25 
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In RAID level 3, a data block is partitioned into stripes which are striped 
across a set of drives. A separate parity drive is used to store the parity bytes 
associated with the data block. The parity is used for data redundancy . Data can be 
regenerated when there is a single drive failure firom the data on the remaining drives 
5 and the parity drive. This type of data management is advantageous since it requires 
less space tiian mirroring and only a single parity drive. In addition, the data is 
accessed in parallel from each drive which is beneficial for large file transfers. 
However, performance is poor for high I/O transaction applications since it requires 
access to each drive in the array. 

10 

In RAID level 4, an entire data block is written to a disk drive. Parity for 
each data block is stored on a siogle parity drive. Since each disk is accessed 
independently, this technique is beneficial for high I/O transaction applications. A 
drawback with this technique is the single parity disk which becomes a bottleneck 
1 5 since the single parity drive needs to be accessed for each write operation. This is 
especially burdensome when there are a number of small I/O operations scattered 
randomly across the disks in the array. 



In RAID level 5, a data block is partitioned into stripes which are striped 
20 across the disk drives. Parity for the data blocks is distributed across the drives 

thereby reducing ttie bottleneck inherent to level 4 which stores the parity on a single 
disk drive. This technique offers fast throughput for smaU data files but performs 
poorly for large data files. 

25 A typical data storage system can contain a nimiber of disk storage devices 

that can be arranged in accordance with one or more RAID levels, A RAID 
controller is a device that is used to manage one or more arrays of RAID disk drives. 

The RAID"ocmtrollCT is responsible for coi^ 

storage system into logical drives where each logical drive is managed in accordance 
30 with one of the RAID levels. 
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RAID controllers are complex and difficult to configure. This is due in part 
to the numerous possible configurations that can be achieved, the knowledge 
required by a user to configure such a system, and the time consumed by a user in 
configuring the controller. In one such RAID controller configuration procedure, an 
5 automatic configuration feature is provided that attempts to alleviate the user's input 
by automatically configuring a nxmiber of devices at system initialization. However, 
this automatic configuration feature is very limited and only operates where all the 
physical disk drives are of the same physical size and where there are between 3 to 8 
disk drives. In this case, the automatic configuration feature configures the disk 
1 0 drives as a single drive group defined as a RAID level 5 system drive with no spare 
drives. This configuration is limited providing no other alternate configurations. 

Accordingly, there exists a need for an automatic RAID controller 
configuration mechanism that can accommodate various types of RAID level 
1 5 configurations and for disk drives having various physical dimensions. 



Summary of the Invention 

The present invention pertains to an apparatus and method for automatically 
configuring disk drives cormected to a RAID controller. The automatic 
20 configuration mechanism is able to gen^ate a full configuration of the disk drives 
coimected to a RAID controller both at system initialization or bootup and at 
runtime. The mechanism uses a robust criteria to configure the disk drives which 
allows the drives to be configured in accordance with one or more RAID levels and 
with various default settings that affect the operation of the disk array. 

25 

In a preferred embodiment, the automatic configuration mechanism includes 
a startup configuration procedure that provides the automatic configuration 

capabiU^^^^tOTfinitialization and a runtime coiifiguration pro 

automatically configures disk drives connected to the RAID controller at runtime. 
30 The startup configuration procedure generates a full configuration of the disk drives. 
The configuration specifies the logical drives that are formed and the associated 
operational characteristics for each logical drive which includes the RAID level, the 
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capacity, as well as other information. The startup configuration procedure can 
accommodate previously existing configurations and partial configurations. In 
addition, the startup configuration procedure can configure unconfigured drives in 
accordance with a criteria that considers the existing configuration of the disk drives 
5 and which is able to select an qjpropriate RAID level suitable for optimizing the 
overall computer system's performance. 

The runtime configuration procedure is used to configure disk drives 
connected to the RAID controller while the system is operational. The inserted disk 

10 drives can be part of an existing configuration or can be imconfigured. The runtime 
configuration procedure can incorporate the configured drives into the current 
configuration as well as configure the unconfigured drives. The unconfigured drives 
are configured in accordance with a criteria that uses the inserted disk drives to 
replace dead or failed drives, that adds the inserted disk drives to certain logical 

1 S drives that can support the additional capacity at the defined RAID level, and that 
forms additional logical drives as needed. 

The automatic configuration mechanism is advantageous since it eliminates 
user interaction required to configure disk drives connected to a RAID controller. In 
20 addition, the mechanism allows disk drives to be configured into one or more RAID 
levels in a maimer that considers the current state of the disk drives and that 
optimizes the ovCTall system performance. The mechanism is flexible performing 
the automatic configuration both at runtime and at system iiutialization. 



25 Brief Description of the Drawing s 

For a better understanding of the nature and objects of the invention, 
reference should be made to the following detailed description taken in conjunction 

wrtiTthe^ixompa^ingl^ 

FIGS. 1 A-IB illustrates a computer system in accordance with the preferred 
30 ^bodiments of the present invention. 

FIG. 2 illustrates a RAID controller in accordance with a preferred 
embodiment of the present invention. 
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FIG. 3 is a flow chart illustrating the steps used to manually configure a set 
of disk drives. 

FIGS. 4-5 illustrate the process of forming and ordering drive groiq>s fi:om 
physical drives in a preferred raibodiment of the present invention. 
5 FIG. 6 illustrates an exemplary assignment of RAID levels to a set of logical 

drives in accordance with a preferred embodiment of the present invention. 

FIG. 7 is a flow chart illustrating the steps used in the startup configuration 
procedure in a preferred embodiment of the present invention. 

FIG. 8 is a flow chart illustrating the steps used to scan the physical devices 
1 0 connected to the RAID controller in a preferred embodiment of the present 
invention. 

FIG. 9 is a flow chart illustrating the logical drive order rules of a preferred 
embodiment of the present invention. 

FIG. 10 is a flow chart illustrating the steps used in the runtime configuration 
1 5 procedure in a preferred embodiment of the present invention. 

FIG. 11 is a flow chart illustrating the steps used to add capacity in 
accordance with a preferred embodiment of the present invention. 

Like reference numerals refer to corresponding parts throughout die several 
views of the drawings. 

20 

Detailed Description of the Invention 

Fig. 1 A illustrates a host system 100 utilizing the RAID controller 102 in a 
first preferred embodiment of the present inventioiL There is shown the RAID 
controller 102 connected to a host peripheral bus 104 and one or more Small 
25 Computer Systan Interface (SCSI) channels 1 06A-1 06N. In a preferred 

embodiment, the RAID controller 102 can be any of the Mylex™ RAID controllers, 
such as but not limited to the DAC960 series of RAID controllers. The operation of 
SCSI chamels^isweUkodAvnin the art a^^ 

in Ancot Corporation, Basics of SCSI, third edition, (1992-1996), which is hereby 
30 incorporated by reference. 
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The host peripheral bus 104 is connected to a host central processing unit 
(CPU) and memory 108. The host peripheral bus 104 can be any type of peripheral 
bus including but not hmited the Peripheral Component Interconnect (PCI) bus. 
Industry Standard Architecture (ISA) bus. Extended Industry Standard Architecture 
5 (EISA) bus. Micro Channel Architecture, and the like. The host CPU and memory 
108 includes an operating system (not shown) that interacts with the RAID 
controller 102. 

Each SCSI channel 106 contains one or more peripheral devices 1 lOA-1 lOZ 
10 such as but not limited to disk drives, tape drives, various types of optical disk 
drives, printers, scanners, processors, communication devices, medium changers, 
and the hke. A SCSI channel 106 A can be used to access peripheral devices located 
within the host system 100 or a SCSI channel 106N can be used to access peripheral 
devices external to the host system 100. 

15 

Fig. IB illustrates a computer system in accordance with a second preferred 
embodiment of the present invention. In this embodiment, the RAID controller 102 
is external to the host system 100. The RAID controller 102 is connected to the host 
system 100 througji a SCSI channel 106A and is connected to one or more 
20 peripheral devices through one or more SCSI chaimels 1 06B- 1 06N. The RAID 
controller 102 and the SCSI channels 106 are similar to what was described above 
with respect to Fig. 1 A, 

Fig. 2 illustrates the components of the RAID controller 102. There is shown 
25 a CPU 1 12 connected to the host peripheral bus 104. The CPU 1 12 is also 
cormected to a secondary peripheral bus 114 coupled to one or more SCSI I/O 
processors 1 16A-1 16N. A SCSI I/O processor 116 can be coupled to a SCSI 
chaimen06A^^d1acts^a5rinterf^^ — 
the SCSI channel 106. The CPU 112 is also coi4)led to a local bus 118 connected to 
30 a first memory device (memory,) 120, a second memory device (memoryj ) 122, and 
a coprocessor 124. The coprocessor 124 is coiq)Ied to an on-board cache memory 
126 which is imder the control of the coprocessor 124. The coprocessor 124 and 
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cache memory 126 is used to retrieve data read to and written from the peripheral 
devices 1 10 as well as perform error correction code (ECC) encoding and decoding 
on data that is read to and from the peripheral devices 110. The cache memory 126 
can employ either a write-through or write-back caching strategy. 

5 

In a preferred embodiment, the CPU 1 12 is a 32-bit Intel i960 RISC 
microprocessor, the first memory device 120 is a flash erasable/programmable read 
only memory (EPROM), the second memory device 122 is a non-volatile random 
access memory (NVRAM), the host peripheral bxis 104 is a primary PCI bus, and the 

10 second peripheral bus 1 14 is a secondary PCI bus. In the first memory device 120, 
there can be stored a startup configuration procedure 128 and a runtime 
configuration procedure 130. The startup configuration procedure 128 is used to 
automatically configure the disk drives at system initialization or bootiq) which 
occurs before the operating system is installed. The runtime configuration procedure 

15 128 is used to configure the disk drives while the operating system is operational. 
In the second memory device 122, there can be stored a configuration file 132 
containing the current configuration of the RAID disk drives. 

In addition, each physical disk drive associated with the RAID controller 102 
includes a configuration file 134 that includes data indicating the configuration of 

20 the drive. 

In an altranate embodiment, the second memory device 122 on the controller 
holds only configuration labels which identify the configuration files 134 on the 
physical disk drives, rather than holding the entire configuration information. 

25 

The foregoing description has described the computer system utilizing the 
technology of the present invention. It should be noted that the present invention is 

not ^nstrain^~to~the coiifiguratidn described above and that other coiifigurations 

can be utilized. Attention now turns to a brief overview of the terminology that will 
30 be used to describe a preferred embodiment of the present invention. This 
terminology is explained in the context of the manual configuration procediu^. 
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The manual configuration procedures are used to create logical disk drives 
from an array of physical disk drives. Typically the configuration process is a 
manual procedure that is initiated hy a user. Fig. 3 illustrates the steps used in the 
manual configuration process. First, a user identifies one or more drive groups (step 
5 172), orders the drive groups (stq) 174), and creates and configures one or more 
logical drives in each drive group with a RAID level as well as other parameter 
settings (step 176). The configuration information is stored in each physical drive 
and in the RAID controller (step 178). The logical drives are then initialized (step 
180) and the configuration is presented by the RAID controller 102 to the host 
1 0 operating system (step 1 82). These steps will be described in more detail below. 



Fig. 4 illustrates a mmiber of physical disk drives arranged in one or more 
drive groups 140, 142, 144 (step 172). The physical disk drives are connected to one 
of the SCSI channels 106. The physical disk drives can be arranged into one or 

15 more drive groups 140, 142, 144. A drive group 140, 142, 144 is used to create 
logical drives having a defined capacity, a RAID level, as well as other device 
settings. The edacity is based on the aggregate of the edacities of each of the disk 
drives in the drive group and depends on the RAID level. In a preferred 
embodiment, the RAID controller 102 can support up to eight drive groups. A drive 

20 group can include one to eight physical drives. Drives that are not included in any 
drive group are considered standby or hot spare drives. The standby drive is a 
redimdant disk drive that is used when a disk drive fails. 



As shown in Fig. 4, the disk drives are configured into three drive groups 
25 referred to as drive group A 140, drive group B 142, and drive groiq) C 144 with one 
standby drive 146. Drive group A 140 contains three disk drives located on SCSI 
channel 10.6 A, drive groiq) B 142 includes three disk drives located on SCSI channel 

106B, and dnve group C 144 inclM^two disk drives situated on SGSTchannel 

106A and one disk drive situated on SCSI channel 106B. Disk drive 146 is 
30 considered the hot spare drive. 
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After all the disk groups have been identified, the drive groups are ordered 
(step 174). The drive groups are ordered with a sequential numeric ordering Scorn 1 
to n, where n is the highest order number. The ordering is used for certain operating 
system purposes and in particular to designate a primary logical drive that can serve 
5 as a boot drive. A boot drive is used to *lK>ot-up" the physical drives in a 
configuration which is described in more detail below. 
f\ 

Next, logical drives in each drive group are created and configured (step 
176). A logical or system drive is that portion of a drive group seen by the host 

10 operating system as a single logical device. There can be more than one logical 
drive associated with a particular drive group. A user creates a logical drive by 
indicating the portions of the drive group that will be part of a particular logical 
drive. For example, as shown in Fig. 5, drive group A includes three physical drives 
150, 152, 154 and three logical drives A^, A|, and Aj. Logical drive A^ spans across 

15 a designated portion of each physical drive 150, 152, and 154 in drive group A. 
Similarly, logical drive A, and Aj each span across a designated portion of each 
physical drive 150, 152, and 154 in drive groiq) A. 

Each logical drive within a drive group is ordered. This order is derived 
20 fix)m the manner in which the logical drives are created. The logical drives are 
created based on the order of their respective drive groups. For instance, the first 
logical drive created in the first drive groi^ is considered the first logical drive, the 
second logical drive created in the first drive group is considered the second logical 
drive, and so on. As noted above, the order is used to define the logical drive that 
25 serves as the boot drive. A boot drive is used at system initialization to boot up the 
physical drives in the configuration. The first logical drive (i.e., logical drive^j) is 
considered the boot drive. In addition, the order is used to add disk capacity which 



is discussal in more detail beldwr "As shown in Fig,- 5;-logical-A|,-is considered the 
first logical drive or boot drive of drive group A, logical drive is considered the 
30 second logical drive, and logical drive A3 is considered the third logical drive. 
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Each logical drive is configured by defining a capacity, a cache write policy, 
and a RAID level. The edacity of a logical drive includes any portion of a drive 
group up to the total capacity of that drive group. 

S The RAID controller 102 has a cache memory 126 that is used to increase the 

performance of data retrieval and stomge operations. The cache memory 126 can be 
operated in a write-back or write-through mode. In a write-back mode, write data is 
temporarily stored in the cache 126 and written out to disk at a subsequent time. An 
advantage of this mode is that it increases the controller's performance. The RAID 
10 controller 102 notifies the operating system that the write operation succeeded 
although the write data has not been stored on the disk. However, in the event of a 
system crash or power failure, data in the cache 126 is lost unless a battery backup is 
used. 



15 In write-tiirough mode, write data is written 6om the cache 126 to the disk 

before a completion status is returned to the operating system. This mode is more 
secure since tiie data is not lost in the event of a power failure. However, the write- 
through operation lowers the performance of the controller 102 in many 
enviroimients. 

20 

Each logical drive has a RAID level which is based on the number of drives 
in the drive group in which it is created. In a preferred embodiment, the following 
RAID levels are supported: 
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RAID LEVEL or JBOD 


DESCRIPTION 


RAIDO 


Striping, Requires a minimum of 2 drives and a 
maximum of 8 drives. This RAID level does not 
support redundancy. 


RAIDl 


MuToring. Requires 2 drives. This RAID level 
siqTports redimdancy. 


RAID3 


Requires a minimum of 3 drives and a maximimi of 
8 drives. This RAID level si^ports redundancy. 


RAIDS 


Requires a minimum of 3 drives and a maximum of 
8 drives. This RAID level supports redundancy. 


RAID 0+1 


Combination of RAID 0 (striping) and 
RAID 1 (mirroring). Requires a minimum of 3 
drives and a maximum of 8 drives. This RAID 
level supports redxmdancy. 


Just a Bunch of Disks 
(JBOD) 


Each drive fimctions independently of one another. 
No redundancy is supported. 



10 TABLE I 

Fig. 6 illustrates an exenq>lary assignment of the RAID levels for a particular 
drive group, drive group B. In this example, logical drive Bq spans across three 
physical drives 156, 158, 160 and logical drive B, spans across the same physical 
15 drives 156, 158, 160. Logical drive B© is assigned RAID level 5 and logical drive Bj 
is assigned RAID level 0+1, 

Once the configuration procedure is completed, the configuration for each 
drive group is stored in each physical drive in a configuration file 134 preferably 

20 located in the last 64K bytes of the drive (step 178), The logical drives are then 
initialized (step 180) and the RAID controller 102 presents the configuration to the 

host operating systOTi (step J 82) 



The foregoing description has described the steps that can be used by a user 
25 to manually configure the disk drives coimected to a RAID controller and introduces 
the terminology used in a preferred embodiment of the present invention. Attention 
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now turns to the methods and procedures that are used to automatically configure the 
disk drives connected to a RAID controller. 

There are two procedures that are used to automatically configure the disk 
5 drives 110 connected to a RAID controller 102. A startup configuration procedure 
128 is used when the controller 102 is powered up or started before the operating 
system is operational. A nmtime configuration procedure 1 30 is used to alter the 
configuration at runtime when additional disk drives are connected to the controller 
102. At runtime, the RAID controller 102 is operational and servicing the I/O 
10 activity. 



Fig. 7 illustrates the steps used by the startup configuration procedure 128 to 
create a configuration for the disk drives connected to the controller 102. At power- 
up, the controller 102 scans all the devices 110 connected to it in order to obtain the 
IS physical cq)acity of a disk drive and to obtain the configuration data Scorn the disk 
drive (step 200). 

Fig, 8 illustrates this step (step 200) in fiirther detail. The startup 
corrfigmration procedure 128 scans each SCSI chaimel 106 (step 202) and each 

20 device 1 10 cormected to the SCSI channel 106 (step 204). In a preferred 

embodiment, the startup configuration procedure 128 can issue the SCSI conunand 
TEST READY UNIT to determine if a peripheral cormected to a SCSI chaimel 106 
is powered up and operational. The startup configuration procedure 128 can also 
issue an INQUIRY conmaand to determine tiie type of the peripheral device 1 10. If 

25 the device 1 1 0 is not a disk drive (step 206-N), then the startup configuration 
procedure 128 continues onto the next device 1 10. Otherwise (step 206-Y), the 
startup configuration procedure 128 obtains the capacity of the disk drive (step 208). 

This can be accomplished by^the^st^lS^c^^ 

READ CAPACITY command to the device. The READ CAPACITY command 

30 returns the maximimi logical block address (LBA) for a disk drive which serves as 
an indicator of the capacity of the device 110. 
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Next, the startup configuration procedure 128 attempts to read the 
configuration file 134 stored in the disk drive (step 210). As noted above, the 
configuration file 134 is preferably located at the last 64K block on the disk drive. 
A configuration file 134 is present if the disk has been previously configured. 
5 Otherwise, the configuration file 134 will not exist. The configuration file 134 
contains configuration information such as the drive group identifier, the logical 
drive identifier, the RAID level, as well as other information. 



The startup configuration procedure 128 repeats steps 202 - 210 for each disk 
10 drive 110 located on each channel 106 that is connected to the RAID controller 102. 

Referring back to Fig. 7, the startup configuration procedure 128 then 
analyzes the configuration information (step 212). The startup configuration 
procedure 128 uses the configuration information to determine the validity of each 
1 5 configuration and to determine which devices have not been configured. 

A complete configuration is one where all the physical drives identified in 
the configuration are connected to the controller 102. A partial configuration is one 
where some of the physical drives identified in the configuration are not connected 
to the controller 102. A valid configuration is one that is either a complete 
configuration or a partial configuration where the logical drives are at least in the 
degraded mode. Degraded mode refers to the situation where the following two 
conditions are met Firstly, the logical drives are configured with redundancy (i.e., 
RAID level 1, 3, 5, or (H-1) and secondly, a physical drive is dead or not operational 
but the logical drive can still operate without any data loss. In degraded mode, the 
drive group is fimctioning and all data is available, but the array cannot sustain a 
fiirther drive failure without potential data loss. 



After analyzing the configuration information, the startup configuration 
procedure 128 determines whether there are any partial configurations present (step 
214). As noted above, a partial configuration is one where some of the physical 
drives identified in the configuration are not connected to the controller 102. If so 
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(step 214-Y), the startup configuration procedure 128 performs corrective actions to 
configure the drive group as a valid configuration (step 216). The procedure 128 
will attempt to place the logical drives in the degraded mode (step 216). 

5 When the corrective action cannot be performed (step 21 8-N), the startup 

configuration procedure 128 terminates processing and an appropriate error message 
can be displayed to the user (step 220). In the case where the corrective action is 
successfiil (stq) 218-Y), the startup configuration procedure 128 continues 
processing. 

10 

The startup configuration procedure 128 then determines whether there are 
any unconfigured drives (step 222). An unconfigured drive is one that does not have 
a configuration file 134 associated with it If there are any unconfigured drives, the 
startup configuration procedure 128 configures the drives with the following 
1 5 configurable default parameter settings (step 224): 

Stripe size: The stripe size is the size of the amount of data written on one 
drive before moving to the next drive. The stripe size is used to tune the controller 
performance for a specific environment or ^plication. Typically, a smaller stripe 
20 size provides better performance for random I/O and a larger stripe size provides 
better performance for sequential transfers. 

Cache line size: The cache line size represents the size of the data that is 
read or written. The cache line size is based on the stripe size. 

SCSI transfer rate: The SCSI transfer rate sets the maximum transfer rate 
25 for each drive chaimel. 

Spin-up option: The spin-up option controls how the SCSI drives in the 
array are started. There are two spin-up modes that may be selected: Automatic and 

On PowerrThe Automatic option causes the controller-t^ 

drives, two-at-a-time at six second intervals, until every drive in the array is 
30 spinning. The On Power option assumes that all drives are ahready spinning. 
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Controller read ahead: The controller read ahead option allows the 
controller to read into the cache a full cache line of data at a time. When this option 
is enabled, the percentage of cache hits is improved. 

Automatic add capacity: This option automatically adds edacity to a 
5 logical drive. 

In addition, the startup configuration procedure 128 associates a RAID level 
with the unconfigured drives based on the following parameter settings and the 
number of inserted unconfigured drives (step 224). The parameter settings that are 
1 0 used to affect the determination of the RAID level are as follows: 

Redundancy: This option specifies whether or not data redundancy is 
required. 

Redundancy method: This options specifies one of the two redundancy 
methods: mirroring or parity. Mirroring refers to the 100% duplication of data on 
one disk drive to another disk drive. Parity or rotated XOR redimdancy refers to a 
method of providing complete data redundancy while requiring only a firaction of the 
storage cq)acity of mirroring. For example, in a system configured under RAID 5, 
all data and parity blocks are divided betweai the drives in such a way that if any 
single drive is removed or fiuls, the data on it can be reconstructed using the data on 
the remaining drives. 

Spare disposition: The controller allows for the replacement of failed hard 
disk drives without the interruption of system service. A hot spare is a standby drive 
that is used in the event of a disk &ilure to rebuild the data on a failed disk. 

A RAID level is assigned to the unconfigured drives based on the 
aforementioned parameter settings and the number of inserted unconfigured drives 
in~ac*x)niance with the following rales wWch area 

30 (1) If the number of unconfigured drives = 1, then the unconfigured drive is 

aJBOD. 

(2) If redundancy is not needed, the RAID level is set to RAID 0. 



15 



20 



25 
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(3) If redundancy is not needed and the number of drives = 2, then the 
RAID level is set to RAID 1 . 

(4) If redundancy is not needed, the number of drives >= 3, and a spare is 
needed, then the largest drive is designated as a spare drive and the number of drives 

5 is decremented by 1 , 

(5) If there are only two unconfigured drives at this point, then the RAID 



level is set to RAID 1. 

(6) If the redimdancy method is mirroring, then set the RAID level to 0+1 . 

(7) If the redundancy method is not mirroring, then set the RAID level to 5. 



15 
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Mirroring 


Yes 
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Parity 


Yes 


5 


Yes 


n-1 



25 TABLE II 

Once the unconfigured drives are configured, their configuration is stored in 
each physical drive and the startup configuration procedure 128 continues 
30 processing(step 224). ~^ — 



The startup configuration procedure 128 then determines whether there is 
more than one valid configuration (step 226). In this case (step 226-Y), a new 
configuration is generated which is an aggregation of all the valid configurations and 
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each logical drive is assigned a logical drive order with respect to the aggregated 
configuration (step 228). An existing logical drive order may be associated with 
each valid configuration thereby requiring the generation of a new logical drive 
order for the aggregated configuration. 

Referring to Fig. 9, the startup configuration procedure 128 reads the 
configuration file 132 associated with each logical drive (step 240). In some 
instances, a single configuration file 132 can be associated with several logical 
drives. This is due to the fact that there is a single configuration file 132 for each 
physical drive and several logical drives can be associated with a particular physical 
drive. The configuration file 132 will include a logical drive number representing 
the drive's logical drive order. The logical drive niunbers found in the configuration 
files 132 will be used as the logical drive order for the combined configuration (step 
242). 

However, there may be two or more logical drives with the same logical 
drive number. In this case, the following conflict rules are used in order to ascertain 
which of the conflicting logical drives takes a higher precedence (step 244). For 
example, if there are two drives, each numbered as the first logical drive, one will be 
considered the first logical drive and the other is considered the second logical drive. 

First, the controller's configuration file 132 is searched for the configuration 
label which is a unique identifier for each configuration. If this label matches the 
label associated with one of the conflicting logical drives, the logical drive indicated 
by the label takes precedence over the other conflicting logical drive. 

In the case where the controller's configuration file 132 does not have an 
identifier matching dther of the coiiflicting logical driveSj-logicalorderpr^ 
is given to the logical drive closest to the first SCSI channel and the first SCSI 
) device (i.e., closest to SCSI chaimel 0, and target identifier 0). For example, if the 
first conflicting drive is located at SCSI channel 1 and has target identifier 3 and a 
second conflicting drive is located at SCSI chaimel 0 and has target identifier 4, the 
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second conflicting drive is given precedence since it is closest to SCSI channel 0, 
target identifier 0, 



Referring back to Fig. 9, the startup configuration procedure 128 then 
5 determines whether there is one valid configuration (step 230). In this case (step 
230-Y), the fiill configuration is presented to host operating system (step 232). 

The foregoing description has described the steps that can be used to 
automatically configure the disk drives connected to a RAID controller at system 
1 0 initialization. Attention now turns to the maimer in which disk drives are 
automatically configured during the run-time operation of the controller. 

While the system is operational, a user can connect one or more physical 
drives to a SCSI channel 106. The runtime configuration procedure 130 is used to 

1 5 detect the added drives and to add the drives to the present configuration. A user can 
insert one or more physical drives which can be preconfigured or unconfigured. 
Based on the number of physical drives that are inserted and the present 
configuration of the disk drives, the runtime configuration procedure 130 can use the 
physical drives to replace failed drives, to expand the capacity of existing logical 

20 drives, and to form new logical drives. 

Referring to Fig. 10, a user can insert an additional drive within an insertion 
delay time ^^ch preferably is one minute. Multiple drives can be inserted one at a 
time and the time between subsequent insertions is less than or equal to the insertion 

25 delay. The controller 102 will take notice of the insertions at the end of the insertion 
delay (step 250-Y). The controller 102 receives an interrupt fi^om a subsystem 
indicating that additional drives have been inserted to a SCSI chaimel 106. In an 
alternate embodiment-where the subsystem does not support-automatic^detectionj3f_ 
drive insertions, a user can utilize a utility to perform the notification to the 

30 controller 102. 
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If only one drive is inserted and there is at least one logical drive in the 
degraded mode (step 252-Y), the inserted drive is used as a replacement for the 
failed physical drive and the data from the failed drive is reconstructed onto the 
inserted drive (step 254). As noted above, degraded mode refers to the case where 
5 there exists one or more dead or non-operational disk drives. 



If only one drive is inserted and there are no logical drives in the degraded 
mode (step 256-Y), the inserted drive is xised to add cq)acity to the system in 
accordance with the add edacity rules shown in Fig. 1 1 . 

10 

Briefly, an attempt is made to add the inserted drives to the last logical drive. 
In other words, the caqpacity of the inserted drives is added to the logical drive 
associated with the highest nmneric order which can also be referred to as the last 
logical drive. However, there is a limit to the number of physical drives that can be 
15 grouped into a drive group and associated with a particular RAID level. If the sum 
of the physical drives in the last logical drive and the mmiber of inserted drives is 
less than or equal to 8, the inserted drives are added to the last logical drive. The 
edacity of the inserted drives is added to the edacity of the last logical drive. 

20 If tile sum of the physical drives in the last logical drive and the number of 

inserted drives exceeds 8, the inserted drives will be added so as to maximize the 
number of logical drives having redundancy. Redundancy is seen at RAID levels 1, 
5 and 0+1 . The runtime configuration procedure 130 will attempt to add the inserted 
drive to the last logical drive. If the last logical drive cannot accommodate all the 

25 inserted drives due to ttie constraints of the corresponding RAID level, the procedure 
130 will add as many as possible and form a new logical drive with the remaining 
inserted drives. For example, if ttiere are 6 drives in the last logical drive and 3 

imeTted"diskdrives,~only one of the inserted disk-drives is a^^ 

drive and a new logical drive is formed with the remaining two drives. The new 

30 logical drive is configured with a RAID level 1 supporting redundancy. 
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Referring to Fig. 1 1» the runtime configuration procedure 130 starts with the 
last logical drive (LD^ = LD J (step 270) and determines whether all the inserted 
drives can be added to the last logical drive (step 272). If the sum of the number of 
inserted drives and the number of drives in the last logical drive is less than or equal 
5 to 8 (step 272- Y), the inserted drives are added to the capacity of the last logical 
drive (step 274). The number 8 represents a threshold that is used in an embodiment 
of the present invention. However, it should be noted that the present invention is 
not constrained to this particular value and that others can be used. 



10 Otherwise (step 272-N), if only one drive was inserted (step 276- Y), the 

runtime configuration procedure 130 forms a new logical drive as a JBOD (step 
282). 

If the number of inserted drives is two (step 284-Y), a new logical drive and 
IS drive group is formed with the inserted drives in accordance with the default 
parameter settings as described above in Table n (step 286). 



If the number of inserted drives is between 3 and 8 (step 288-Y), a new 
logical drive and drive group is formed with the inserted drives in accordance with 
20 the default parameter settings as described above in Table n (step 290). 

If the number of inserted drives exceeds eight (step 288-N), one or more 
logical drives and/or drive groups are formed at RAID levels 1 , 5, or 0+1 based on 
the default parameter settings as described above in Table n (step 292). 

25 

Referring back to Fig. 10, die runtime configuration procedure 130 then 
updates the configuration in the configuration file 134 associated with the physical 

drive (step-268)- At a later time, the controller402-presents-the new_configura 

the host operating system (step 268). 



30 
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If multiple unconfigured physical drives are inserted (step 260- Y), the 
runtiine configuration procedure 130 allocates the drives as follows. First, the 
physical drives are used to rebuild any existing dead drives (step 262). If there are 
no dead drives or there are remaining inserted drives after the dead drives have been 
5 replaced, the runtime configuration procedure 130 uses the inserted drives to add 
capacity to existing logical drives in accordance with the above mentioned add 
capacity rules shown in Fig. 1 1 (step 258). 

The runtime configuration procedure 1 30 then updates the configuration in 
10 the configuration file 1 34 associated with the physical drive (step 268). At a later 
time, the controller 102 presents the new configuration to the host operating system 
(step 268), 

In the case where there are single and/or multiple configured drives inserted 
15 into the controller 102 (step 264-Y), the configuration of the inserted drives is 

combined with the existing configuration and the logical drive order is determined as 
described above with respect to Fig. 9 (step 266). The runtime configuration 
procedure 130 then updates the configuration in the configuration file 134 associated 
with the physical drive (step 268). At a later time, the controller 102 presents the 
20 new configuration to the host operating system (step 268). 

The foregoing description has described an automatic configuration 
^paratus and procedures which are used to configure the disk drives connected to a 
RAID controllCT. The configuration procedures can automatically configure a group 

25 of disks to operate at various RAID levels both at runtime and a system 
initialization. The automatic configuration apparatus and procedures are 
advantageous since they eliminate user intervention, knowledge, and time required 

for configuring the controller 
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Alternate Fmhndiments 
The foregoing description, for purposes of explanation, used specific 
nomenclature to provide a thorough understanding of the invention. However, it 
5 will be apparent to one skilled in the art that the specific details are not required in 
order to practice the invention. In other instances, well known circuits and devices 
are shown in block diagram form in order to avoid unnecessary distraction fi-om the 
underlying invention. Thus, the foregoing descriptions of specific embodiments of 
the present invention are presented for purposes of illustration and description. They 

10 are not intended to be exhaustive or to limit the invention to the precise forms 

disclosed, obviously many modifications and variations are possible in view of the 
above teachings. The embodiments were chosen and described in order to best 
explain the principles of the invention and its practical appHcations, to thereby 
enable others skilled in the art to best utilize the invention and various embodiments 

15 with various modifications as are suited to the particular use contemplated. It is 

intended that the scope of the invention be defined by the following Claims and their 
equivalents. 

Fiulher, the method and system described hereinabove is amenable for 
20 execution on various types of executable mediums other than a memory device such 
as a random access memory. Other types of executable mediums can be used, such 
as but not limited to, a computer readable storage medium which can be any 
memory device, conq)act disc, or floppy disk. 



25 
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1 . A computer-implemented method for automatically configuring disk drives 
connected to a controller, the method comprising the steps of: 

5 scaiming the disk drives connected to the controller; 

finding one or more unconfigured disk drives; 

prq>aring a configuration for the imconfigured disk drives in accordance with 
one of a plurality of RAID levels; and 

storing the configuration with the controller. 

10 

2. The method of claim 1 , 

said preparing step fiirther comprising the steps of: 

associating with each unconfigured disk drive one or more parameter 

settings; and 

1 5 configuring the unconfigured disk drives with a RAID level based on 

the nimiber of unconfigured disk drives and the parameter settings. 



3. The method of claim 2, 

wherein the parameter settings comprise the set consisting of redundancy 
20 needed, redundancy method, and spare disposition. 



4. The method of claim 1, fiirther comprising the steps of: 

when only one unconfigured disk drive is found, configuring the 
unconfigured disk drive as a single drive (IBOD); 
25 when the unconfigured disk drives are associated with a parameter setting 

indicating no redimdancy, configuring the unconfigured disk drives in accordance 
with a RAID level 0; 

— : when the unconfigured disk-drives are associated-with a parameter-setting 

indicating mirroring, configuring the unconfigured disk drives in accordance with a 
30 RAID level of 0+1; and 
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when the unconfigured disk drives are associated with a parameter setting 
indicating that mirroring is not required, configuring the unconfigured disk drives in 
accordance with RAID level of 5, 

5 5 . The method of claim 4, fiirther comprising the steps of: 

when the unconfigured disk drives are associated with a parameter setting 
indicating that a spare disk drive is required, 

selecting a largest unconfigured disk drive as the spare disk drive, 
decrementing a number of unconfigured disk drives by the spare disk 

10 drive, and 

configuring the unconfigured disk drives with a RAID level that 
supports the number of unconfigured disk drives in accordance with the parameter 
settings, 

15 6. The method of claim 1 , 
before said storing step, 

finding one or more vaUd configurations of disk drives, and 
combining the valid configurations with the configuration of the 
unconfigured disk drives into a fiill configumtion. 

20 

7. The method of claim 1, 
before said storing step, 

finding a partial configuration, 

converting the partial configuration into a valid configuration, and 
25 combirung the valid configuration with the configuration of the 

unconfigured disk drives into a fiill configuration. 



8: 

30 



The method of claim ly^fiirther comprising the step of: 

presenting the fiill configuration to a host operating system. 
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9. The method of claim 1 , 

detecting at runtime that at least one disk drive was added to the controller, 
determining at runtime that one or more dead drives are connected to the 

controller, and 

S substituting at runtime each dead drive with one of the added disk drives. 



10. The method of claim 9, 

said substituting step fiirther comprising the step of: 

reconstructing data stored on a dead drive onto the substituted added 

10 disk drives. 

1 1 . The method of claim 9, further comprising the steps of: 

adding one or more of the added drives to an existing logical drive; and 
when the added drives cannot be added to an existing logical drive, forming 
1 5 one or more logical drives with the added drives. , ^ ^ ^ 

12. . A computer-implemented method for automatically configuring disk drives 
connected to a RAID controller, the method comprising the steps of: 

detecting at runtime that at least one disk drive was added to the RAID 
20 controller, 

determining that one or more dead drives are cormected to the RAID 
controller, and 

substituting each dead drive with one of the added disk drives. 

25 13, The method of claim 12, further comprising the steps of: 

adding one or more of the added drives to an existing logical drive; and 
when the added drives cannot be added to an existing logical drive, forming 
one or more logical drives with the added drives. 
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14. The method of claim 13, 

said substituting step fiirflier comprising the step of: 

reconstructing data stored on a dead drive onto one of the added disk 

drives. 

5 

15. The method of claim 13, 

said forming step further comprising the step of: 

associating each formed logical drive with one of a plurality of RAID 

levels. 

10 

16. The method of claim 12, further comprising the steps of: 

providing an existing configuration for existing disk drives connected to the 
RAID controller, 

determining that the added disk drives are associated with an added 
15 configuration; and 

forming a new configuration Grom the existing configuration and the added 
configuration. 

1 7. The method of claim 1 6, fiirther comprising the steps of: 

20 wherein the existing configuration includes an existing logical drive order; 

wherein the added configuration includes an added logical drive order; 
generating a new logical drive order for the new configuration using the 
existing configuration and the added configuration. 

25 18. A computer readable storage medium that directs a RAID controller 

connected to disk drives grouped into logical disk drives to function in a specified 

maimer, comprising: 

a startup configuration procediEf^including instnxctions having a c^ 

to generate a configuration for unconfigured disk drives, the unconfigured disk drive 
30 configuration including one or more logical disk drives, each logical disk drive 

configured in accordance with one of a plurality of RAID levels. 
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1 9. The apparatus of claim 1 8, 

the startup configuration procedure including instructions having a capability 
to configure each logical disk drive in accordance with a set of parameter settings 
associated with the logical disk drive and the number of imconfigured disk drives. 

20. The q)paratus of claim 1 8, 

the startup configuration procedure including instructions having a capability 
to generate a fiill configuration from one or more valid configurations, one or more 
partial configurations, and one or more unconfigured disk drive configurations. 

21 . The a^yparatus of claim 1 8, fiirther including: 
a runtime configuration procedure including instructions having a capability 

to configure one or more disk drives added to the RAID controller during runtime. 



15 22. The apparatus of claim 21, 

the runtime configuration procedure including instructions having a 
c^ability to replace each dead drive with an added drive. 

23. The ^paratus of claim 22, 

20 the runtime configuration procedure including instructions having a 

capability to add one or more of the added drives to one or more logical drives. 

24. The apparatus of claim 23, 

the runtime configuration procedure including instructions having a 
25 capability to generate one or more new logical drives including the added drives. 



25. A computer readable storage medium that directs a RAID controller 
cormectedtodisluirives grouped m&aplurality of logical drives to fimction in a- 
specified manner, comprising: 
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a runtime configuration procedure including instractions having a capability 
to configure one or more disk drives added to the RAID controller during runtime, 
the instructions replacing each dead drive with an added drive and adding one or 
more of the added drives to one or more of the logical drives. 

26. The apparatus of claim 25, 

the runtime configuration procedure including instructions having a 
capability to generate one or more new logical drives including the added drives. 



10 27. The apparatus of claim 25, 

wherein each new logical drive is associated with a RAID level selected fi*om 
the set consisting of RAID level 1 , RAID level 5, or RAID level Of 1 . 



28. The apparatus of claim 25, 

15 the runtime configuration procedure including instructions that add one or 

more of the added drives starting from a last logical drive that can accommodate the 
added drives. 

29. The apparatus of claim 25, fiirther comprising: 

20 a starti^ configuration procedure including instructions having a cq)ability 

to generate a configuration for unconfigured disk drives, the unconfigured disk drive 
configuration including one or more logical disk drives, each logical disk drive 
configured in accordance with one of a plurahty of RAID levels. 
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